Distilling the Knowledge in a Neural Network

نویسندگان

Geoffrey E. Hinton

Oriol Vinyals

Jeffrey Dean

چکیده

A very simple way to improve the performance of almost any machine learning algorithm is to train many different models on the same data and then to average their predictions [3]. Unfortunately, making predictions using a whole ensemble of models is cumbersome and may be too computationally expensive to allow deployment to a large number of users, especially if the individual models are large neural nets. Caruana and his collaborators [1] have shown that it is possible to compress the knowledge in an ensemble into a single model which is much easier to deploy and we develop this approach further using a different compression technique. We achieve some surprising results on MNIST and we show that we can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model. We also introduce a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse. Unlike a mixture of experts, these specialist models can be trained rapidly and in parallel.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neuron Mathematical Model Representation of Neural Tensor Network for RDF Knowledge Base Completion

In this paper, a state-of-the-art neuron mathematical model of neural tensor network (NTN) is proposed to RDF knowledge base completion problem. One of the difficulties with the parameter of the network is that representation of its neuron mathematical model is not possible. For this reason, a new representation of this network is suggested that solves this difficulty. In the representation, th...

متن کامل

A comparison between knowledge-driven fuzzy and data-driven artificial neural network approaches for prospecting porphyry Cu mineralization; a case study of Shahr-e-Babak area, Kerman Province, SE Iran

The study area, located in the southern section of the Central Iranian volcano–sedimentary complex, contains a large number of mineral deposits and occurrences which is currently facing a shortage of resources. Therefore, the prospecting potential areas in the deeper and peripheral spaces has become a high priority in this region. Different direct and indirect methods try to predict promising a...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Mapping Dieback Intensity Distribution in Zagros Oak Forests Using Geo-statistics and Artificial Neural Network

The first and most important issue in forest drought management is knowledge of the location and severity of forest decline. In this regard, we used geostatistics and artificial neural network methods to map the dieback intensity of oak forests in the Ilam province, Iran. We used a systematic random sampling with a 250 × 200 m grid to establish 100 plots, each covering 1200 m2. The percentage ...

متن کامل

Classification of Iranian traditional musical modes (DASTGÄH) with artificial neural network

The concept of Iranian traditional musical modes, namely DASTGÄH, is the basis for the traditional music system. The concept introduces seven DASTGÄHs. It is not an easy process to distinguish these modes and such practice is commonly performed by an experienced person in this field. Apparently, applying artificial intelligence to do such classification requires a combination of the basic infor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1503.02531 شماره

صفحات -

تاریخ انتشار 2015

Distilling the Knowledge in a Neural Network

نویسندگان

چکیده

منابع مشابه

Neuron Mathematical Model Representation of Neural Tensor Network for RDF Knowledge Base Completion

A comparison between knowledge-driven fuzzy and data-driven artificial neural network approaches for prospecting porphyry Cu mineralization; a case study of Shahr-e-Babak area, Kerman Province, SE Iran

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

Mapping Dieback Intensity Distribution in Zagros Oak Forests Using Geo-statistics and Artificial Neural Network

Classification of Iranian traditional musical modes (DASTGÄH) with artificial neural network

عنوان ژورنال:

اشتراک گذاری